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Abstract 

A central assumption in our analysis of cosmic structure is that cosmological perturbations have 
zero ensemble mean. This property is one of the consequences of statistically homogeneity, the 
invariance of correlation functions under spatial translations. In this article we explore whether 
cosmological perturbations indeed have zero mean, and thus test one aspect of statistical homo- 
geneity. We carry out a classical test of the zero mean hypothesis against a class of alternatives 
in which perturbations have non-vanishing means, but homogeneous and isotropic covariances. 
Apart from Gaussianity, our test does not make any additional assumptions about the nature of 
the perturbations and is thus rather generic and model-independent. The test statistic we employ 
is essentially Student's t statistic, applied to appropriately masked, foreground-cleaned cosmic mi- 
crowave background anisotropy maps produced by the WMAP mission. We find evidence for a 
non-zero mean in a particular range of multipoles, but the evidence against the zero mean hypoth- 
esis goes away when we correct for multiple testing. We also place constraints on the mean of the 
temperature multipoles as a function of angular scale. On angular scales smaller than four degrees, 
a non-zero mean has to be at least an order of magnitude smaller than the standard deviation of 
the temperature anisotropics. 



I. INTRODUCTION 



The cosmological principle is one of the cornerstones of modern cosmology. Roughly 
speaking, the principle states that the universe is homogeneous and isotropic on large scales. 
Although large-scale homogeneity and isotropy were initially postulated, in recent decades 
the principle has received mounting experimental support, and today there is little doubt 
about its validity. The cosmological principle has a slightly preciser formulation, which states 
that a perturbed Friedman- Robertson- Walker metric provides an accurate description of the 
universe. Thus, according to the cosmological principle, the universe is well-described by 
the perturbed spacetime metric 

ds^ = a\r)) [-(1 + 2$(i, x))drj^ + (1 - 2^{t, x))dx^] , (1) 

with sufficiently small (scalar) perturbations at long wavelengths 

(2) 

But apart from that, the principle has nothing to say about the properties of these pertur- 
bations. 

Many of the advances in modern cosmology consist in the characterization of the metric 
perturbations $(x) and ^(x). Though not often explicitly emphasized, one of the key 
assumptions is that these perturbations are just a particular realization of a random process 
in a statistical ensemble. Hence, we do not really try to describe the actual perturbations 
x) and \E'(t, x); our goal is to characterize the statistical properties of the random fields 
$(t,f) and ^(t,f). 

Let x) denote any random field in the universe, such as the metric perturbations 
considered above, or the energy density of any of the components of our universe. The 
statistical properties of the random field are uniquely specified by its probability distribution 
functional. It turns to be simpler however to study the moments of the field {(f){xi) ■ ■ • , 
where (• • • ) denotes ensemble average, and all the fields are evaluated at a common but 
arbitrary time t, which we suppress for simplicity. 

The cosmological principle has formal counterparts in the properties of the perturbations, 
though, as we emphasized above, the cosmological principle itself only requires the actual 
perturbations in our universe to be small. We say that a random field is statistically 
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homogeneous (or stationary), if all its moments are invariant under translations, 

= (0(fi + f)---0(x„ + f)), VfGR^Vr^GN. (3) 

In some cases, statistical homogeneity may apply only to some field moments. The random 
field is stationary in the mean if 

(0(f)) = +f)) vfeR^ (4) 

and it is stationary in the variance if 

'A0(xi) A0(f2)) = (A0(fi + f ) A4>{x2 + f)) Vf G R^ (5) 



where we have defined A0 = 0— (0). If the random field is Gaussian, the one- and two-point 
functions uniquely determine all the remaining moments of the field. A Gaussian random 
field stationary in the mean and in the variance is hence automatically fully stationary. 

Parallel definitions apply to the properties of the perturbations under rotations. In par- 
ticular, we say that a random field is isotropic in the mean if 

(0(x)) = (0(o + i?- (x-o))) Vo G R^ Vi? G S0(3). (6) 

Analogously, a random field is isotropic in the variance if 

^A0(fi) A0(x2)^ = (^A(f){d+R-{xi-d)) A0(o + i?- (x2 -o))^ Vo G R^ Vi? G 50(3). 

(7) 

Since there is always a rotation that maps x to x -|- T, and because any two points related 
by a rotation always differ by a translation, equations ^ and ^ imply that homogeneity 
and isotropy in the mean are equivalent. But homogeneity in the variance does not imply 
isotropy in the variance, though the converse is true pQ, 

Isotropy in the variance =^ Homogeneity in the variance. (8) 

Homogeneity and isotropy in the mean have an important consequence: Equations ^ or 
(|6| immediately imply that the expectation of a stationary field is constant, 

(0(x)) = const, (9) 

and, conversely, any random field with constant mean is homogeneous and isotropic in the 
mean. Because, by definition, cosmological perturbations always represent deviations from 



a homogeneous and isotropic background, it is then always possible to assume that the 
constant value of their mean is zero, if they happen to be stationary. For example, in 
perturbation theory we write the total energy density p as a background value po plus a 
perturbation 5p, 

p = Po{t) + 6p{t,x). (10) 

This split into a background value and a perturbation is essentially ambiguous, unless we 
specify what the background actually is. In cosmology, what sets the background apart 
from the perturbations is symmetry. Because of the cosmological principle, the background 
energy density po is defined to be homogeneous. Hence, if the constant mean of the stationary 
random field 6p is not zero, we may redefine our background and perturbations by 

Po Po = Po + (5p), 5p ^ 5p = 5p- {5p), (11) 

without affecting the overall value of the energy density, p p = p. In this case the redefined 
perturbation 6p has zero mean, while the redefined background po is still space- independent. 

It is important to recognize that cosmological perturbations can be assumed to have zero 
mean if and only if their mean is a constant. Consider again the example of the energy 



density (10), but now assume that Sp is not stationary. Although the redefinitions (11) 
allow us to set the mean of the perturbations 6p to zero, the redefined background po is 
inhomogeneous in this case, in contradiction with our definition of the background density 



Po in equation (10). Therefore, we conclude that homogeneity in the mean, isotropy in the 



mean and zero mean are all equivalent. 

Zero mean Homogeneity in the mean Isotropy in the mean. (12) 

Homogeneity and isotropy in the variance also have important implications [1]. If a 
random field is stationary in the variance, its two point function in momentum space has to 
be proportional to a delta function, 

{A4>{k,)A^{h)) = {2nfS{h + (13) 

and if the variance is isotropic, the power spectrum can only depend on the magnitude 
of fc, 

VS)=mk). (14) 
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Hypothesis 


Mean 


Variance 


Zero 


Homogeneous 


Isotropic 


Ho 


yes 


yes 


yes 


Hi 


yes 


yes 


no 


H2 


yes 


no 


no 


Hi 


no 


yes 


yes 


Ha 


no 


yes 


no 


H5 


no 


no 


no 



TABLE I: The six possible different combinations of statistical properties of the primordial per- 
turbations. We are concerned here with the mean and variance alone. 



Based on the equivalences (|8j) and (12), there are hence six possible different combina- 
tions of the statistical properties of the primordial perturbations, which we list in table 
[T| Hypoth esis Hq describes the standard assumption that underlies most analyses of cos- 
mological perturbations, and case Hi describes what is usually known as a violation of 
statistical isotropy. In this article we focus on violations of the zero mean hypothesis, cases 
H3 through H5. Our goal is to test the standard assumption Hq against one of its non-zero 
mean alternatives. 



II. TEMPERATURE ANISOTROPIES 

At present, the arguably cleanest and widest window on the primordial perturbations 
in our universe stems from the temperature anisotropies observed in the Cosmic Microwave 
Background Radiation (CMB). Hence, if we want to test whether cosmological perturbations 
have zero mean, we need to explore how these temperature anisotropies are related to the 
random fields that we have considered in the introduction. 

A. Harmonic Coefficients 

In a homogeneous and isotropic universe, different Fourier modes of cosmological pertur- 
bations evolve independently in linear perturbation theory. Hence, we may always assume 
that the temperature anisotropies (of primordial origin) observed at point x in the direction 
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n are given by 

—r{k,n)me''-', (15) 

where 0(fc) are the Fourier components of a random field at a sufficiently early time, and 
T is a transfer function whose explicit form we shall not need. Say, in a standard ACDM 
cosmology we have = $, where $(a;) is the primordial Newtonian potential, which, because 
of the absence of anisotropic stress, also equals ^(a;). Due to the linear relation between 
temperature anisotropics and primordial perturbations, it immediately follows that zero 
mean in the primordial perturbations implies zero mean of the temperature anisotropics. 

For many purposes, it is more convenient to represent functions on a sphere, like the 
temperature ffuctuations, by their spherical harmonic coefficients 

fim = J d'^n f{n)Yt>min). (16) 

Throughout this article we work with real spherical harmonics 1^^? whose properties are 
summarized in appendix A. To calculate the spherical harmonic coefficients of the tempera- 
ture anisotropics aim = STim we note that because linear perturbations evolve in an isotropic 
background (by definition), the transfer function T(/c, h) can only depend on the two scalars 
k and k ■ n. Hence, we may expand the latter in Legendre polynomials P^, 

r{k, n) = r(fc, k-n) = J](2^ + l){-iY%{k)P,{k ■ n), (17) 



with real functions Te{k). Substituting then equation (17) into (15), setting x = 0, and using 
the addition theorem for (real) spherical harmonics in equation ( A5 ) we get 

/d^k - 
j^re{k)myim{k)- (18) 

Clearly, if primordial perturbations have zero mean, so do the spherical harmonic coefficients 
of the temperature anisotropics: 

= ^ (de^) = 0. (19) 

In particular, a violation of the condition (dim) = would thus imply a violation of statistical 
homogeneity. 

Later we shall also need to know the covariance of the temperature multipoles, which 



follows from equation (18). If the random field is homogeneous and isotropic in the 



variance, the covariance matrix of the multipoles has elements 

/dk 
-j^'P^{k)%{kY. (20) 

Recall that for arbitrary / we define 

^flm = fim — {fern}- (21) 

Thus, 6 denotes departures from a homogeneous and isotropic background, whereas A de- 
notes deviations from the ensemble mean. In the following, we drop the hat from random 
variables and fields. 



B. Foreground, Noise and Masks 

Unfortunately, the temperature anisotropics we actually observe in the sky are not entirely 
of primordial origin. They are a superposition of primordial anisotropics ST and foreground 
contributions, such as dust emission and synchrotron radiation from our very own galaxy. 
Appropriate foreground templates allow the WMAP team to eliminate foregrounds in some 
regions of the sky [3j, but the cleaning procedure does not completely remove foreground 
contamination along the galactic disc. It is thus necessary to subject these maps to additional 
processing. 

Since the actual temperature measurements involve a convolution with the detector beam 
B, and also include detector noise A^, we model the temperature anisotropics in a smoothed, 
foreground-reduced map by 

5T^ap = K*[B*{6T + F) + N], (22) 

where K is the smoothing kernel, the star denotes convolution, and F represents the residual 
foreground contamination. We assume that the smoothing kernel and the detector beam are 
rotationally symmetric. This is actually not the case for the WMAP beam, but it should 
be a good approximation at the scales we are going to consider. In that case, in harmonic 
space, the convolution acts on the spherical harmonic coefficients simply by multiplication, 
say, 

iB*f)em = B,fem. (23) 

In order to remove the residual foregrounds F, the contaminated sky regions have 
to be masked out. Let M(n) be the corresponding mask, which is defined by 
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M{n)[K * B * F]{n) = 0. Then, by construction, the masked sky STm does not contain 
foregrounds, 

6TM{n) = M{n)6T^^p{n) = M{n) [K * B * 6T + K * N] (n). (24) 

It proves then useful to define an hypothetical "unmasked" smoothed sky map which is free 
of foregrounds, but contains the effects of noise and detector beam, and whose multipoles 
are hence given by 

bim = KeBettim + KiNim- (25) 

The multipole coefficients of the masked sky 5Tm are obtained by multiplication with an 
appropriate convolution matrix. If Mim denotes the (real) spherical harmonic coefficients of 
the mask, it is easy to show that the elements of the convolution matrix are given by 

Mi^rm/2m2 = y2 ^=^(^1' "^15 ^' "^5 ^2, m2)Mem, (26) 

V 4:TT 

Im 



where D is defined in equation (A6). In particular, the spherical harmonic coefficients of 
the masked sky are given by 

Qm = {STM)em = ^ ^im,im ^Im- (27) 
Irh 

In practice, we need to work with finite matrices, so we restrict our attention to a finite 
range of multipole values, < £ < ^max- In particular we assume that M is a square matrix. 

The WMAP team has found that detector pixel noise is well described by a Gaussian 
distribution with zero mean [2], which implies that 

(Qm) = XI ^irnM Bl (o^- ) . (28) 
tfh 

Therefore, the masked temperature anisotropics have zero mean if the primordial 
anisotropics do. According to the WMAP team |2], the noise variance is inversely pro- 
portional to the number of times point each pixel is observed. This is not the same for all 
pixels, but it is is fairly isotropic (see figure [T]). Hence, assuming a constant N^hs^ and that 
the noise in different pixels (of area A) is uncorrelated, with variance a^/A^obs, we find 

(29) 

where A/q = Ao-f^/Nohs- In any case, at the scales we are interested in, the contribution of 
the noise to the variance of the masked temperature anisotropics is subdominant. This is 
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FIG. 1: The variance of the noise in the temperature anisotropies is inversely proportional to 
the number of times each sky pixel is observed, A'obs(^)- The figure shows a plot of the "power 
spectrum" Ne = Ylm(.^ohs)irn{N~^Jijn in units of the monopole iVo- Weighted by ({i + 1), 
this captures the degree of anisotropy of the noise variance on angular scales ~ 180°/£. 



however not crucial for our analysis, which simply assumes that the noise satisfies equation 
(29), without reference to the actual magnitude of A/q- 



C. Hypotheses 

In order to test whether the primordial temperature anisotropies have zero mean (which 
follows from (0) = 0) we need additional information about the distribution of the harmonic 
coefficients aim- At this point, there is no evidence for non-Gaussian primordial pertur- 
bations so we assume that the latter are normally distributed. In order to uniquely 
characterize their distribution, it suffices then to consider their variance. Among the hy- 
pothesis with zero mean in Table [T| Hq is the one that underlies most of our analyses of 
structure. We shall therefore adopt Hq as our null hypothesis. If Hq is true, then, according 



to equation (20), the variates Aagm = ciim are independent and have a common variance for 



the same values of Therefore, the standard assumption Hq can be cast as a precise form 



of the distribution of the temperature multipoles a^m, which follows from equations (19) and 
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Ho: Primordial perturbations are normally distributed with zero mean, homogeneous and 
isotropic variance =^ 



dP{air. 



exp 



2C, 



da. 



(30) 



Note that equations (20) and (29), together with the definition (25), imply that the covari 



ance matrix of the unmasked, foreground-reduced temperature anisotropies is also diagonal, 



m.2 ■ 



(31) 



Clearly, if the null hypothesis Hq does not appropriately fit the data, we won't be able to 
determine whether this is because temperature fiuctuations are non-Gaussian, non-isotropic, 
non-homogeneous, or simply because we used the wrong power spectrum. We need to analyze 
the data in the face of an alternative hypothesis, namely, that primordial perturbations do 
not have zero mean. Among all the cases with non-zero mean in table [T| the minimal 



deviation from Hq is hypothesis H^, which also leads to the covariances (20). Therefore, we 
choose as alternative 

H^: Primordial perturbations are normally distributed with non-zero mean and homoge- 
neous and isotropic variance =^ 



dP{air, 



1 



exp 



1C, 



da 



(32) 



In this case, the covariance of the unmasked, foreground-reduced temperature anisotropies 



is again given by equation (31) 



We test Hq against H^. What singles out our test is the ability to examine the zero mean 
hypothesis against the alternative hypothesis of non-zero means. Indeed, the only difference 
between Hq and H^ lies in the assumptions about the mean of the perturbations. Without 
the alternative hypothesis, we would be simply conducting a goodness-of-fit test. 

Mathematically it is certainly sensible to postulate hypothesis H^, but the reader may 
wonder whether H^ is also physically reasonable. In fact, we think it is. Suppose for in- 
stance that primordial perturbations are created during an infiationary period in a slightly 
inhomogeneous universe (after all, if infiation is supposed to explain cosmic homogeneity, 
it should start with an inhomogeneous universe.) If we regard these small inhomogeneities 
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as first order perturbations, in linear perturbation theory the properties of the created 
perturbations — vacuum fluctuations of an appropriate field — only depend on the homoge- 
neous and isotropic background. Hence, the resulting seeded perturbations turn out to be 
homogeneous and isotropic in the mean and the variance, as in the conventional case, but 
they have to be added on top of the already existing initial inhomogeneities. In fact, similar 
ideas have been already discussed in the literature [5H7]. 



III. TEST STATISTIC 



If we just happened to know the temperature multipoles a^m (or their foreground-cleaned 
counterparts bim), a test of hypothesis Hq against its alternative H3 would be straight- 
forward. Under the null hypothesis, for fixed i, the variables a£m form a set of independent, 
normally distributed variates with zero mean and common variance C^. The standard and 
time- honored way to test the latter involves Student's t statistic, 

t^^EU^, (33) 

where 

ai = 21^1 ^^'^ "^^ ^ ^ X^(a^m - ^if (34) 

m m 

are, respectively, unbiased estimators of the mean and variance of the distribution. Under 
the null hypothesis, t follows Student's distribution with u = 2£ degrees of freedom, while 
under the alternative hypothesis, its square is distributed like a ratio of non-central chi- 
squares (more about this below.) Note that we do not need to make any assumption about 
the actual values of the Ce in order to know how t is distributed under the null hypothesis. 
As we mentioned above, though, it is not possible to subtract part of the galactic con- 



tamination, so we are forced to work with the masked sky in equation (24). While masking 



preserves the property of zero mean, equation (28), it does not preserve the diagonal form 
of the covariance matrix. Hence, one of the key assumptions of Student's test is lost. To 
bring the problem back to the realm of Student's t, we shall impose additional symmetries 
on the problem. 

On large angular scales, the main source of foreground contamination stems from the 
galactic disc, which can be covered by a mask that spans galactic latitudes in the range 
1^1 ^ 15°. Let us hence assume that the mask is symmetric under rotations around the 
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galactic 2;-axis. Under such rotations, the real spherical harmonics transform according to 



equation (A9). Hence, rotational invariance implies = for m 7^ 0. When we substitute 



the last relation into equation (26) we find, using the results in the appendix. 



Me-^mi/2m2 ^ ^mim2 ^ud M^^ (35) 

In other words, the mask matrix is diagonal in m space, and we can basically restrict our 
attention to m > 0. Of course, a mask with these symmetry properties cannot eliminate 
all sources of foreground contamination (primarily point sources). We shall address this 
problem by restricting our analysis to large angular scales, for which the contribution of 
point sources is small. 

For notational simplicity let us call M(m) the (symmetric) matrix with elements Mii,rn/2mi 
and let us collect all the multipoles f^m for fixed m into a single vector 

— * 

/m = (/om, /im, • • • , /wm) , with /^^ = for £ < |m| . (36) 



In this notation then, M(m) = M(_m), and equation (27) reads 

C„ = M^rn)bm- (37) 

Suppose now that we find a positive integer mmax < ^max and a vector v such that 

M(m)V = V, m= -m^ax, • • • , "^max- (38) 

In other words, suppose that we find a sky Vim with no components along the contaminated 
region, such that Vim = Vi for all values of m between — mmax and mmax- If such a vector 
exists, it must clearly have vanishing components for i < m-aiax-, since |m| cannot exceed £. 
Under these conditions then, the 2mmax + 1 variables dm = v- Cm (with |m| < mmax) satisfy 

dm = v-bm, (39) 



where we have used equation (38) and M(m) = M^^y In particular, these variates do not 
contain galactic residual backgrounds, because they can be constructed from a masked sky, 
they have zero mean under the null hypothesis, 

^max 

(dm) = Yl ^^(^irn), (40) 
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and, because of equation (31), they have covariances 

{AdmiAdm^) = O-^f^mima (l^^ll < "^max, |"^2| < "^max), (41) 

where 

■^max 

a'^ Yl Kl{BlC, + N,)vl (42) 
The off-shot of this construction is not only that the variables dm are uncorrelated (and 



hence independent), but also that their variance, equation (42), is the same for all of them. 
Note that, by construction, the variables dm only contain temperature multipoles him in the 

range £min < ^ < ^max, where £min = "^max- 

A. Test Statistic 



Equation (41) states that the 2mmax+ 1 variates dm < mmax) form a set of normally 
distributed independent variables, with common variance o"^. They are thus statistically 
analogous to the primordial temperature multipoles aim-, which are independent and have 
the same variance for fixed I. Under the null hypothesis Hq the dm have zero mean, and 
under the alternative hypothesis their mean is generically non-zero. Because the dm 
are linear combinations of the normally distributed a^mi the distribution of the former is 
also Gaussian. It is hence natural to choose Student's t as test statistic, although, for later 
convenience, we shall actually work with its square, 

e ^ + (43) 

where 

E ^'-^ E ^drn-df. (44) 

^ ' ' ''max ~r -1- i-max , , . 

Intuitively, the nature of the test statistic is clear: Up to factors that involve mmax, is 
just the square of the ratio of sample mean to sample standard deviation. We would expect 
this ratio to be small if the variables indeed have zero mean, and large if they don't. 

One of the keys of our test statistic is that we know its distribution both under the null 
and the alternative hypothesis. The identity 'Ylmi^rn)'^ ~ 2^maxS^ -l- (2minax + 1)^? implies 
by an extension of Cochran's theorem (section 15.20 in [8j and section 35.7 in [9]) that 



the numerator and denominator of equation (43) are independent variables, both under Hq 
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and H-^. Again by the same extension of Cochran's theorem, the numerator (divided by cr^) 
follows a non-central chi-square distribution with vi = 1 degrees of freedom, and non-central 
parameter 

Ai = (2m^ax + l)^, (45) 

whereas 2mmax'S^ (divided by a^) follows a non-central chi-square distribution with 
V2 = 2mjnax dcgrccs of freedom, and non-central parameter 

A,= J: m_M, (46) 

|m|<mmax 

The statistic is thus a ratio of non-central chi-squares divided by their respective number of 
degrees of freedom. The latter follows a doubly non-central F distribution, with probability 
density given by (section 24.30 in [TU] ) 

(47) 

where B is the beta function. If all the ensemble means are equal, fi = (dm), A2 vanishes, 
and the distribution of simplifies to a non-central F-distribution with degrees of freedom 
1/1 = 1 and z/2 = 2mmax, and non-central parameter A = Ai. Under the null hypothesis Hq 
the non-central parameter is A = 0, and the distribution reduces to a central F with u = 1 
and 1^2 = 2mmax5 which is just the square of Student's t distribution. 

We carry out a one-sided test of the null-hypothesis Hq at significance a (say, a = 5%) 
by rejecting the null hypothesis if t"^ is larger than (t^)a, where (t^)a is the a-point of the 
central F^ui = 1, z/2 = 2mmax) distribution, 

a = P{t^>{f)a\Ho). (48) 

This amounts to a two-sided test of Ho using Student's t, in which we reject the null hy- 
pothesis for sufficiently large deviations (of either sign) of t from zero. In order to evaluate 
the power of this test, 1 — /3, we need to determine the probability of rejecting the null 
hypothesis when the alternative hypothesis is true, 

l-P = P{t^>{t^)a\H^). (49) 

The power is a function of the hypothetical standardized non-zero ensemble means 
{dm) /(J- In the absence of any particular model, and for the purpose of illustration, we shall 
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tidier 



FIG. 2: Power curves as for a test with a = 5% significance and mmax = 100 as a function of 
a standardized common mean. The solid curve shows the power of a test based on Student's t, 



equation (43). The dashed curves show the power of a test based on the statistic (55) for A; = 0, 25 



and 50. Clearly, Student's t-test is more powerful than any of the tk tests. (Note that we can restrict 
our attention to A; < 50, since for higher k values the statistic has the same distribution as 1/i^, 
with k < 50.) For different choices of parameters, the power curves have the same qualitative form. 
Evidently, as the number of degrees of freedom 1^2 = 2mmax increases, so does the power of any of 
these tests. 

consider the simple (though perhaps somewhat umealistic) case of equals means: fia = {ciem)- 



Using equations (39) and (25) this translates into 



fid 



{dm) = Zfj,a, where Z = ^ K^B. 



eVe. 



(50) 



£=m„ 



In general, we may regard fia as a measure of the order of magnitude of the mean of the 
primordial temperature anisotropics in the corresponding multipole range, even if the means 
are not common. We plot the power of the test as a function of /id/a, for mmax = 100, in 
figure [2j We see for instance that in a test with mmax = 100, if the mean of our variables is 
about 0.3 times their standard deviation (or larger), the t"^ statistic will fall in the critical 
region almost with certainty. 
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B. Confidence Intervals 



The value of the statistic tells us not only whether the null hypothesis holds, but also 
how far from zero the mean may be if the alternative hypothesis is the correct one. Let 
t\ denote the a-point of Student's distribution, and, for simplicity, let us assume again 
that all the means of the primordial anisotropies have a common value Then, since the 
statistic 

= ^2 (51) 

is distributed like under the alternative hypothesis H^, we can write -P(t^ < ^a) = 1 — «• 
Following the conventional approach to classical interval estimation, we can cast the last 
relation as a confidence interval that formally involves /x^, 

P (d - <^^d<d+ '-\ ) =l-a. (52) 

This is a frequentist interval: If we repeat the same procedure to derive the confidence 
interval times, as approaches infinity our interval will contain the true mean in 
(1 — a)N cases. 

C. Extension to Several Statistics 

We can also extend this analysis to a set of several independent statistics, designed to 
probe the temperature anisotropies at different angular scales. Suppose for instance that we 
find not one, but a set of n different vectors ■u'-*^ {i = 1, . . .n) that satisfy the set of equations 



(38), with mmax = "^max and £max = ^max- Then, for fixed i, the variates dm = "W^*^ ■ still 



have a common variance, as in equation (41). We can therefore define a set of n different 
statistics simply by replacing J* by cT^*^ and mmax by mmlx in equations (43) and (44). But 
if the ranges mmax < ^ < ^max are disjoint, the vectors w*^*-* do not have any common element, 
and the variables J^*^ and are also uncorrelated for i ^ j. In that case, the different t"^ 
statistics are mutually independent, and probe the temperature anisotropies in the disjoint 
multipole ranges 

i^,^<i<i^^, with £!:L = m(L. (53) 
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D. Alternative Tests 



A perhaps undesirable property of our test statistic is that is not a scalar under 
rotations. Since the mask breaks rotational symmetry anyway, this is not a problem by 
itself. Nevertheless, the mask does preserve the symmetry under azimuthal rotations, so it 
would be natural to demand at least invariance of our statistic under this unbroken subgroup. 

Under an azimuthal rotation by an angle 0, the variates transform as in equation 



(A9), with a^rn replaced by dm- Clearly, the statistic t in equation (43) does not remain 



invariant under such rotations. But the transformation law (A9) immediately suggests how 
to address the problem. Indeed, the variables 

(ll = dl ql = \{dl + dl^) (m>0) (54) 

are invariant under azimuthal rotations by construction, and they also share the same vari- 
ance. Hence, any ratio of the sum of two disjoint subsets of these squared variables is 
distributed like a ratio of independent (eventually non-central) distributions. 
Consider for instance the set of statistics 

k 

y q' 

,2 '"rnax m=0 

m=fc+l 

where k is an arbitrary parameter. As before, under the alternative hypothesis the 



probability density of t\ is given by equation (47), with degrees of freedom and non-central 
parameters given by, respectively, 

k 



m=l 

^ TTlniax 

^2 = ^max - k, A2Cr^ = 2 ^ {{dm}'' + {d-m)') ■ (57) 

m=k+l 

In particular, under the null hypothesis Hq, t\ follows a central F distribution with z/i and 
1^2 degrees of freedom. 



Using equation (47) we calculate the power of the set of alternative tests based on the 
statistic (55). Assuming that all the means are common, as above, we find the power curves 
in figure[2j Inspection of the figure quickly reveals that the test based of Student's statistic 
is uniformly more powerful than any test based on a t\ statistic. This is no coincidence at all; 
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Student's t test is widely employed because of its optimal properties (see example 23.14 in 
|lUj.) Therefore, in this article we just focus on Student's t^. As long as we stick to a single 
(random) orientation of the sky, our results have a straight-forward statistical interpretation, 
since all we need to know is how the test statistic is distributed for an arbitrary (but fixed) 
sky orientation. In addition, the non-scalar nature of the statistic may help us to identify 
that area of the sky eventually responsible for a violation of the zero mean hypothesis. To 
conclude, we should also point out that Student's t test is known to be robust to departures 
from normality, at least for independent variables drawn from the same distribution (section 
31.3 in [in]). 

IV. DATA AND ANALYSIS 

Our data analysis pipeline consists of four main steps. First, we construct an appropriate 
mask to eliminate residual galactic foregrounds. Then, we identify a vector v that belongs 
to the range of all the mask matrices Mm for |m| < mmax- We degrade the cosmic microwave 
maps to lower resolution, and apply our test statistic to these maps. Finally, we check for 
an eventual residual contamination in our results. This section lists the details of each of 
these steps. The reader not interested in technical details is welcome to skip this part and 
jump to the next section for the actual results. 

Mask 

As mentioned above, in order to eliminate galactic contamination, and preserve azimuthal 
symmetry at the same time, we construct a mask invariant under rotations along the galactic 
2;-axis. Our starting point is a HEALPix^ pixelization of the sphere with A'^side = 64. We set 
all pixels in the mask with galactic latitude |&| < 20° to zero, and all the remaining pixels to 
one. The effective area covered by the mask is 66% of the full sky. We label the components 
of the pixelized mask by M{ni), where i runs over all the A^pix = 12 x 64^ = 49152 pixels of 
the mask. This particular mask is in fact also symmetric under parity, but we do not make 
explicit use of this symmetry. 

^ http://healpix.jpl.nasa.gov/ 
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Vector V 



In order to find a v that satisfies equation (38), we look for a common solution of the set 
of equations 

(1 - M(m)){; = 0, m = -mmax, • • • , mmax- (58) 
Since M(m) is the mask matrix, the linear operator 1 — M(m) gives the components of the 



vector along the contaminated galactic region. Hence, equation (58) states that the vector 
V should have a vanishing component along such region. 

In order to find the components of it is numerically more convenient to work in real 
space. We fix the values of £max and mmax and calculate a matrix D whose elements are 
defined by 

Du = {l-M{ni)) J2 Yemif^r), (59) 

|rrt|<m,max 

where i runs over all pixels in a HEALPix pixelization of the sphere with A^^sidc = 64, and 



i = rrijy^ax, • • • , ^max- Then, the set of equations (58 ) reads Da ve = 0, or 



Dv = 0. (60) 



We find an approximate solution of equation (60) by singular value decomposition. 



D = J2Ua^aV;[. (61) 

a 

—* 

Here, the Ua is a set of A^pix-dimensional orthonormal vectors, the Sq, are the singular values 
(arranged in order of decreasing magnitude), and the is a set (£max — "^max + l)-dimensional 
orthonormal vectors. We choose the vector v to be the last right singular vector, that is, 
V = Va, with a = ^max " ""^max + 1- We label the corresponding singular eigenvalue Siast- 
Because the vectors Ua and Va are orthogonal, the singular value Siast is the norm of D v, 



2 



EE 



Dw,] (62) 



In general, this singular value is non-zero, so our solution of equation (60) is not exact but 
only approximate. Modulo a normalization factor, the value of Siast is then an indicator 
of the potential degree of contamination, i.e., the overlap between our vector v and the 
contaminated galactic region. The latter typically increases with increasing mmax, since the 



number of non-zero elements of freely available to solve equation (60) decreases with 
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FIG. 3: The logarithm of the absolute value of the sky defined by equation (63), for ^max — 

212 



and minax = 177. Those regions of the sky with the largest values are those that are more heavily 



weighted in our statistic, as implied by equation (39). As seen in the figure, the galactic region is 



basically excluded from our analysis. For other choices of ^max and mmax the structure of the sky 
is the same, as long as the singular eigenvalue Sjast remains sufficiently small. 

increasing mmax- Because the power of the t"^ test increases with the number of degrees 
of freedom, z/2 = 2m^a_^, we choose the maximum possible value of m-max for which Siast, 
divided by the norm of the mask times the norm of sky encoded in v, remains under 5 ■ 10~^. 
Indeed, the resulting vector can be represented visually, by defining the sky 

I'^l ^ ^max; '^max ^ ^ ^ ^max 

(63) 

0, otherwise, 

which captures those regions of the sky that enter our statistic. As an example, the corre- 
sponding real space sky for ^max = 212 and mmax = 177 is shown in figure [3j 

The same process can be repeated for different choices of ^max and mmax- If the corre- 



sponding intervals (53) do not overlap, we can use the resulting set of vectors v to construct 
a set of mutually independent t"^ statistics, as explained in subsection III C 
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Maps and Processing 

We analyze the seven-year (version 4), full resolution, foreground-reduced Q2, V2 and Wl 
differencing assembly cosmic microwave anisotropy maps provided by the WMAP mission.^ 
Since these maps probe different frequencies of the microwave spectrum, a dependence of 
our results on the particular map would indicate non-thermal foreground contamination. 
We expect the latter to be smallest for the W and V maps, and largest for the Q map. We 
thus take the Wl assembly to be our fiducial map, and keep the V2 and Q2 assemblies just 
for comparison. 

The WMAP has subtracted the dipole and primordial monopole from the three differenc- 
ing assemblies, and the latter have been smoothed with a Gaussian kernel K of FWHM = 1°. 
The sky maps are expanded into (real) spherical harmonics, and band-limited to a maximum 
multipole value (i = £max- The different values of £max are chosen iteratively to cover the 
multipole range < £ < 212 with non-overlapping intervals. We choose i = 212 as the 
absolute maximum for i because we expect point sources to significantly contaminate the 
temperature anisotropies at higher multipoles. Note that it is not necessary to mask the 
sky prior to processing, since the vector v has no components along the galactic region by 
construction. 



Contamination 



There are two main possible sources of systematic errors in our analysis: Galactic con- 



tamination due to an insufficiently resolved mask or an imperfect solution of equations (58), 
and point source contamination due to unmasked high-latitude point sources. In order to 
estimate both we basically follow the same approach. 

Let us assume that the values of £max and mmax have been fixed. To estimate the amount 
of galactic contamination, we subtract from the cosmic microwave maps the portion of the 
sky covered by the WMAP seven-year temperature analysis mask and run the resulting sky 
map through the analysis pipeline described above. The change in the P-value of the 
statistic is then a measure of galactic contamination. For imax = 212 and mmax = 177 for 
instance, the change in the t"^ statistic after subtraction of the galaxy is less 0.01% for the 



^ Available at http://lambda.gsfc.iiasa.gov/. 
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Wl map. 

Similarly, to estimate the amount of point source contamination we construct a sky map 
of temperature anisotropics from the point source fluxes listed in the WMAP point-source 
catalog [llj. We subtract the point source map from the cosmic microwave background 
and run the resulting sky map through our data analysis pipeline. The change in the 
corresponding P-value of the statistic is then a measure of point source contamination. 
Certainly, there are unresolved point sources that the WMAP catalog does not contain, 
but the contribution of these sources is small compared to the contribution of the actually 
detected sources that we are not able to mask. Say, if we choose £max = 212 and mmax = 177, 
the change in the statistic after point source subtraction is less 1% for the Wl map. Point 
sources do not typically have thermal spectra, so an inspection of our results for different 
differencing assemblies gives us yet another handle on such contamination. 

An alternative way to estimate point source contamination involves the ratio 

R, ^ (64) 

where the sim are the multipoles of the temperature map constructed from detected point 
sources alone, and the aim are the spherical harmonic coefficients of the analyzed sky (say, 
the Wl map). The sums in only run over m, because our statistics are sensitive only to 
a relatively small window of multipoles in i space. As shown in figure |4| R^ remains below 
1% up to £ ~ 256, which is in broad agreement with our direct estimate of point source 
contamination using the statistic. Because point source contamination decreases with the 
frequency of the map [TT], the Wl differencing assembly is less contaminated than the other 
two assemblies. This is why we take Wl to be our fiducial map. 

Note by the way that none of the procedures described above is an actual attempt to 
subtract galactic or point source contamination. Instead, it is just a way to estimate the 
contribution of the unmasked foregrounds to the P-value of our statistic. 

V. RESULTS 
A. P- values 

Our results are summarized in table [TT] and represented graphically in figure [5j There are 
two multipole ranges in which the P-value of our statistic is smaller than 5%. In the first 
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0.02 r 



0.01 
0.008 - 

0.006 
0.004 - 

0.002 - 




FIG. 4: Power in detected W-band point sources relative to actual cosmic microwave anisotropies, 



equation (64). The point source contribution is subdominant at low multipoles, and reaches 1% at 
i w 256. 

case, for £ = 61 — )■ 86, the P-value across the three differencing assembhes remains under 
1.5%, so it does not seem that this result is due to residual foregrounds alone. In particular, 
in this multipole range point source contamination is negligible. In the second case, for 
i = 177 — )■ 212, the P-value of the statistic for the Q2 map is normal, so we may tentatively 
attribute the difference in the values of among maps to foreground contamination. This 
explanation however is somewhat problematic, because we expect foregrounds to make the 
actual value of t"^ less likely, and because contamination is stronger in the Q2 map, whose 
t"^ is normal. 

Can we speak then of statistically significant evidence against the null hypothesis? To 
answer this question, we need to realize that we have constructed a set of n = 8 independent 
tests of the null hypothesis, one for each multipole range. Hence, if we would like the 
P-values of all individual statistics to be larger than a with probability 1 — atot (under the 
null hypothesis), we should choose the size a of each individual test to satisfy 



1 - ab 



■ot 



:i-ar. 



For n = 8 and 1 — atot = 95%, this yields a = 0.64%. None of the P-values in table 11 
low. 



(65) 



IS as 
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Ran 


ge 




Q2 




V2 


Wl 


f ■ { 


max 




P-value 




P-value 




P-value 


1 


18 


1.957 


29.7% 


2.367 


26.4% 




95 8% 

ZirJ.O /O 


19 


38 


0.269 


60.7% 


0.208 


65.1% 




65 7% 

UO. 1 /o 


39 


60 


1.963 


16.5% 


2.212 


14.1% 




i 1 . u /o 


61 


86 


6.341 


1.3% 


6.431 


1.2% 


7.397 


0.7% 


87 


112 


0.699 


40.4% 


0.275 


60.1% 


0.639 


42.5% 


113 


142 


0.473 


49.2% 


0.816 


36.7% 


0.406 


52.5% 


143 


176 


0.029 


86.5% 


0.012 


91.4% 


0.025 


87.5% 


177 


212 


2.582 


10.9% 


4.615 


3.2% 


4.091 


4.4% 



TABLE II: The value of the statistic and the corresponding P-value (the probability for the 
statistic to be larger than the value we observe.) The range column indicates the multipoles that 
enter the calculation of the statistic (^min < ^ < ^max)- We perform the same analysis for three 
different differential assembly maps, Q2, V2 and Wl. The P-value is anomalously small (less than 
5%) for all three maps just in one multipole range: 61 to 86. 

We reach similar conclusions by calculating the value of a statistic often used to combine 
the results of multiple independent tests of a single hypothesis: Stouffer's weighted Z test 
|12j . Let Pi be the P-value of our test in the i-th multipole range, and let Zi be the Pi 
point of a standard normal distribution. Then, the variate 



(66) 



where the Wi are the weights assigned to each test, follows a normal distribution with zero 
mean and unit variance. We weigh each test by the number of degrees of freedom of the 
corresponding test, Ui = 2mmax. For the eight P-values listed in table |ll[ and the Wl 
differencing assembly map, the value of Stouffer's statistic is 

Z = 1.21, (67) 

clearly under two sigma away from the mean of the standard normal distribution. 
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FIG. 5: Visual representation of table |nj We plot the P-values of the statistic under the null- 
hypothesis for different multipole ranges and different differencing assemblies (blue for VKl, red for 
y2, green for Q2.) For reference, the horizontal line marks 5% probability. Clearly, the value of 
in the multipole range 61 < ^ < 86 is anomalously small. 

B. Goodness of Fit 

The reader may also wonder how much better the data are fit by a distribution with 
non-zero mean. In order to find that out, we calculate first an effective by extremizing 
the likelihood under both the null and the alternative hypothesis. 



(68) 



Since the variates dm are normal and independent, the likelihood is simply a product of 
Gaussian density functions. Therefore, sample mean sample variance respectively are the 
maximum-likelihood estimators for the population mean and the population variance. 
The difference Ax = Xo ~ X3 is a measure of how much the fit improves when we relax the 



assumption of zero mean. Because of equation (68), this difference is a monotonic function 



of the ratio of maximum likelihoods under Hq and i/3, which also happens to be a monotonic 
function of the statistic (example 24.1 in [ID]). For illustration, we list the corresponding 



values of A^^ in table III Clearly, since we have an additional parameter to fit the data, we 
expect a better fit under Hi,. To correct for the presence of additional parameters, several 
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•^max 


dof Ax^ 


AAICc 


ABIC 


1 


18 


o 
o 


9 /in 




1.31 


19 


38 


oy 


n 91 


-2.02 


-3.46 


39 


60 


7Q 


9 ^9 


0.41 


-1.85 


61 


86 


123 


7.24 


5.17 


2.43 


87 


112 


175 


0.64 


-1.40 


-4.52 


113 


142 


227 


0.41 


-1.63 


-5.02 


143 


176 


287 


0.02 


-2.00 


-5.63 


177 


212 


355 


4.08 


2.06 


-1.79 



TABLE III: Comparison of fits to tlie Wl data under the null and alternative hypotheses. Positive 
values indicate that the alternative hypothesis is a better description of the data. The change 
in simply shows that the temperature anisotropics are better fit in the presence of additional 
parameter. The AICc corrects for the latter, and indicates strong evidence for the alternative 
hypothesis in the multipole range 61 — t- 86. The BIC heavily penalizes the presence of an additional 
parameter, but still indicates substantial evidence for a non-zero mean in that multipole range. 



model selection measures have been proposed in the literature [To]. In table III we list 
the difference in the corrected Akaike information criterion (AICc) and the difference in 
the Bayesian Information Criterion (BIC). From a Bayesian perspective, the difference in 
information criteria A is a measure of relatively model likelihood 

C{Ho\d) ( A\ 

exp -- . (69) 



This equation allows us then to interpret A/2 as a number of standard deviations. Again, 
a distribution with non-zero mean seems to be a better model to describe the data in the 
multipole range 61 — t- 86. But as we emphasized above, this is relatively likely to happen if 
multiple ranges of multipoles are considered. 

C. Confidence Intervals 

The actual values of the test statistic for our choices of £max and mmax also allow us 
to place the first limits on the magnitude of an eventual common mean of the primordial 
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Range Q2 V2 Wl 



^min 


^max 


/^min 


/^max 


/^min 


/^max 


/^min 


A'max 


VC~£ 


1 


18 


-7.061 


3.596 


-6.989 


3.308 


-6.962 


3.244 


7.20 


19 


38 


-0.275 


0.464 


-0.283 


0.448 


-0.278 


0.438 


2.87 


39 


60 


-0.335 


0.058 


-0.336 


0.049 


-0.347 


0.039 


1.92 


61 


86 


-0.278 


-0.033 


-0.278 


-0.034 


-0.286 


-0.045 


1.46 


87 


112 


-0.156 


0.063 


-0.137 


0.079 


-0.153 


0.065 


1.30 


113 


142 


-0.134 


0.065 


-0.140 


0.052 


-0.129 


0.066 


1.17 


143 


176 


-0.112 


0.094 


-0.106 


0.095 


-0.108 


0.092 


1.09 


177 


212 


-0.230 


0.023 


-0.251 


-0.011 


-0.249 


-0.003 


0.98 



TABLE IV: Lower and upper limits on a common mean of the primordial temperature multipoles 
in the given multipole range at 95% confidence level. Temperature units are fxK. For comparison we 
list WMAP's best-fit estimate of the binned power spectrum around the center of the corresponding 
multipole range. 



perturbations in the given range of multipoles. These limits are collected in table |IV| and 
graphically represented in figure [6j At angular scales smaller than about four degrees, the 
limits are typically one order of magnitude below the standard deviation of the temperature 
multipoles. In two cases, the confidence interval does not contain zero, which is again an 
expression of an anomalously high value of the t"^ statistic in the corresponding multipole 
range. But, as before, since these are 95% confidence intervals, the probability that all of 
them contain the true mean is only 66%. In any case, these limits should not be taken 
too literally. The assumption of a common mean is somewhat unrealistic, so these intervals 
should be rather interpreted of an order of magnitude estimate of possible deviations from 
the zero-mean assumption, even if the means of the anisotropics do not share a common 
value in the corresponding multipole range. 

VI. CONCLUSIONS 

Our results show significant evidence for a non-zero mean of the temperature multipoles 
in the range i = 61 to i = 86, at the 99.3% confidence level. Taken as a whole however, 
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FIG. 6: Limits on the value of a common mean at 95% confidence limit (in ^uK units), as in 



table IV Again, blue, red and green label the limits derived from the Wl, V2 and Q2 band maps 
respectively. The extent of the interval on the £ axis indicates the range in values of I for which 
the limit applies. For comparison we also plot the variance of the multipole components for the 
best WMAP's estimate of the binned power spectrum, it-y/C^. Note that the temperature scale is 
logarithmic, with positive and negative values on either side of the axis. 



because this range is just one among eight different multipole bins, the evidence against the 
zero-mean assumption is statistically insignificant, falling under the 95% confidence level. 

Whatever the case, the limits we have set on the mean of the primordial anisotropies in 
a set of multipole bins indicate that an eventual non-zero mean has to be about an order 
of magnitude smaller than the standard deviation of the temperature anisotropies. In that 
sense, observations constrain the mean to be small. In retrospective, we have therefore par- 
tially justified the common assumption of vanishing mean of the cosmological perturbations. 
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Appendix A: Real Spherical Harmonics 



In this article we expand functions defined on a sphere in real spherical harmonics Y^^. 
These are related to the conventional complex spherical harmonics by 



Yfm. = < 



m = 



(Al) 



V2Reyi^, m > 0. 



It follows that the real multipole coefficients ag^ and their complex counterparts Aim are 
related to each other by 



— \/2lmAe-m, m < 
Aim, m = 

\/2Re Aim, m> 0. 



Aim — < 



[ai_rn + iaim) , m < 
aira-, m = 



(A2) 



m > 0, 



where we have assumed that the function on the sphere being expanded is real. The trans- 



formation (Al) is unitary, that is, we can write 



(A3) 



with U a unitary matrix, whose matrix elements are implicitly defined by equation (Al) 
Because of the unitary transformation, real spherical harmonics are orthonormal. 



(A4) 



and they also satisfy the addition theorem 

4:71 



Pii.ni ■ ^2) = 2f I 1 ^irnini)Yemin2) 



(A5) 



where Pi is a Legendre polynomial. 

Sometimes we need to integrate over the product of three spherical harmonics. We define 

D{ii,mi; £2,^2; £3,^3) = V^TT / d^nYi^m^Yi,^m2Yhm3, (A6) 
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which clearly is totally symmetric in its three arguments. Since the real spherical harmonics 
are related to the complex spherical harmonics by a unitary transformation, this expression 
is closely related to the integral of the product of three complex spherical harmonics T>. The 
latter can be expressed as a product of Clebsch-Gordan coefficients (or Wigner symbols), so 
we have 

D{ii,mi; £2,^2] £3,1713) = mi;i2,m2\i3,m3)Ui^^^, (AT) 

mi,m2,m3 

with 



mi; £2,^21^3,^3) = \/ ^^^' ^^^_^^|^^^'^^^ ^i,0;4,0|4,0)(4,mi;£2,m2|4,m3). (A8) 

It follows then for instance that mi; £2, "^21^3, ""^s) = ^(^1, —^i]^2, ~^2\(^3i —^3)- 

Under (active) azimuthal rotations by an angle the complex spherical harmonic coeffi- 
cients transform according to Aim — ^ ^~^^'^-Aiim- Therefore, it follows from the left equation 



in (A2) that real spherical harmonic coefficients transform according to 

atm cos(m (j))atm - sin(m 0)a^_m- (A9) 
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